Integrating Multiple Classifiers In Visual Object Detectors Learned From User Input
نویسندگان
چکیده
There have been many recent efforts in contentbased retrieval to perform automatic classification of images/visual objects. Most approaches, however, have focused on using individual classifiers. In this paper, we study the way in which, in a dynamic framework, multiple classifiers can be combined when applying Visual Object Detectors. We propose a hybrid classifier combination approach, in which decisions of individual classifiers are combined in the following three ways: (1) classifier fusion, (2) classifier cooperation, and (3) hierarchical combination. In earlier work, we presented the Visual Apprentice framework, in which a user defines visual object models via a multiple-level object-definition hierarchy (region, perceptual-area, object part, and object). As the user provides examples from images or videos, visual features are extracted and multiple classifiers are learned for each node of the hierarchy. In this paper, we discuss the benefits of hybrid classifier combination in the Visual Apprentice framework, and show some experimental results in classifier fusion. These results suggest possible improvements in classification accuracy, particularly of detectors reported earlier for Baseball video, images with skies, and images with handshakes.
منابع مشابه
Learning Structured Visual Detectors from User Input at Multiple Levels
In this paper, we propose a new framework for the dynamic construction of structured visual object/scene detectors for content-based retrieval. In the Visual Apprentice, a user defines visual object/scene models via a multiple-level definition hierarchy: a scene consists of objects, which consist of object-parts, which consist of perceptual-areas, which consist of regions. The user trains the s...
متن کاملAutomatic selection of visual features and classifiers
In this paper, we propose a dynamic approach to feature and classifier selection. In our approach, based on performance, visual features and classifiers are selected automatically. In earlier work, we presented the Visual Apprentice, in which users can define visual object models via a multiple-level object definition hierarchy (region, perceptual-area, object-part, and object). Visual Object D...
متن کاملOnline multiple people tracking-by-detection in crowded scenes
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...
متن کاملIntegrating object detectors
ace detector, object detector, integration, expected computational cost, cascade of classifiers, decision tree This paper describes a method for integrating object detectors that reduces the expected computational cost of evaluating all the detectors whilst obtaining the same logical behaviour as running the detectors independently. The method combines the decision trees of the different object...
متن کاملBayesian learning for weakly supervised object classification
We explore the extent to which we can exploit interest point detectors for representing and recognising classes of objects. Detectors propose sparse sets of candidate regions based on local salience and stability criteria. However, local selection does not take into account discrimination reliability across instances in the same object class, so we realise selection by learning from weakly supe...
متن کامل